Performance assessment of hybrid machine learning approaches for breast cancer and recurrence prediction

Breast cancer is a major health concern for women everywhere and a major killer of women. Malignant tumors may be distinguished from benign ones, allowing for early diagnosis of this disease. Therefore, doctors need an accurate method of diagnosing tumors as either malignant or benign. Even if therapy begins immediately after diagnosis, some cancer cells may persist in the body, increasing the risk of a recurrence. Metastasis and recurrence are the leading causes of death from breast cancer. Therefore, detecting a return of breast cancer early has become a pressing medical issue. Evaluating and contrasting various Machine Learning (ML) techniques for breast cancer and recurrence prediction is crucial to choosing the best successful method. Inaccurate forecasts are common when using datasets with a large number of attributes. This study addresses the need for effective feature selection and optimization methods by introducing Recursive Feature Elimination (RFE) and Grey Wolf Optimizer (GWO), in response to the limitations observed in existing approaches. In this research, the performance evaluation of methods is enhanced by employing the RFE and GWO, considering the Wisconsin Diagnostic Breast Cancer (WDBC) and Wisconsin Prognostic Breast Cancer (WPBC) datasets taken from the UCI-ML repository. Various preprocessing techniques are applied to raw data, including imputation, scaling, and others. In the second step, relevant feature correlations are used with RFE to narrow down candidate discriminative features. The GWO chooses the best possible combination of attributes for the most accurate result in the next step. We use seven ML classifiers in both datasets to make a binary decision. On the WDBC and WPBC datasets, several experiments have shown accuracies of 98.25% and 93.27%, precisions of 98.13% and 95.56%, sensitivities of 99.06% and 96.63%, specificities of 96.92% and 73.33%, F1-scores of 98.59% and 96.09% and AUCs of 0.982 and 0.936, respectively. The hybrid approach’s superior feature selection improved the accuracy of breast cancer performance indicators and recurrence classification.


Introduction
The fact that it kills so many women is why breast cancer continues to be a serious public health problem worldwide.Breast cancer is the most frequent malignancy in women, with an estimated 2.3 million cases in 2020 and an estimated 685,000 deaths worldwide [1,2].The unchecked growth of carcinogenic, malignant tumors in the breast initiates the local spread of cancerous cells.Women of all ages and ethnicities are more at risk due to increased prevalence rates.Metastasis and recurrence (or relapse) of breast cancer are major causes of mortality.Breast cancer can recur years or even decades after it has been treated.Early detection and diagnosis of breast cancer greatly improves prognosis and survival time for people with this disease.An individual will be subjected to fewer unnecessary operations if the cancerous mass is identified.As a result, research into the most effective method of diagnosing breast cancer has to be prioritized [3,4].
Due to its unique characteristics in the identification of variables from complicated datasets of cancer disorders, machine learning (ML) is widely employed in addressing cancer classification and model forecasting.Breast cancer diagnosis and prognosis pose significant difficulties for the medical surgeon.ML methods have greatly aided in the early diagnosis of cancer.The application built on ML methods improves the accuracy of breast cancer diagnosis and prognosis.This has steadily decreased breast cancer mortality over the past two decades [5].
Feature selection plays a crucial role in enhancing the accuracy and efficiency of predictive models, especially in the realm of breast cancer and breast cancer relapse prediction.With the increasing availability of biomedical data, identifying relevant features from a large pool of variables becomes essential to building robust and interpretable models.Feature selection methods aid in selecting a subset of informative features, contributing to the development of accurate and computationally efficient prediction models.In the context of breast cancer research, these methods are pivotal for identifying key biomarkers and risk factors that influence the progression and relapse of the disease.Additionally, Optimization techniques play a crucial role in improving the accuracy and efficiency of breast cancer and relapse prediction models.The selection of optimization techniques depends on the characteristics of the dataset, the nature of the problem, and the specific requirements of the prediction model.Combining multiple optimization strategies often leads to improved model performance and robustness.Researchers often leverage various feature selection and optimization techniques to discern meaningful patterns from diverse datasets, improving the precision of predictive models and advancing our understanding of breast cancer dynamics [6,7].

Research gap and motivation
Breast cancer is a disease that affects women of all races, and its incidence increases with age.Among female cancer patients, it accounts for the vast majority of fatalities.Early identification and prediction have been proposed as effective strategies for combating this aggressive cancer.Thus, ML-based recurrence prediction for breast cancer is a pressing medical issue that poses significant hurdles to the scientific community.Predicting how breast cancer will behave is crucial because it helps doctors choose the best course of action for each individual patient, leading to better outcomes.It's also linked to better deployment of healthcare resources for such people.Both physicians and data scientists agree that elucidating the causes of breast cancer recurrence so early is a pressing area of study.Several ML algorithms and statistical approaches, which have led to better breast cancer diagnosis and prediction, have been applied in several investigations of cancer recurrence.Multiple models have been proposed in recent years to predict whether or not breast cancer will return in the next years of surgery; however, all of these models have severe drawbacks.Several attempts have been made to employ ML techniques to foretell a woman's survival after being diagnosed with breast cancer.This is because of proactive breast cancer diagnosis and prognosis and the revolutionary treatment methods currently under development.
Breast cancer and its recurrence prediction have been the subject of extensive research, with a predominant focus on clinical perspectives.However, within the realm of optimization and feature selection methodologies, a distinct gap persists in the current state-of-the-art literature.Existing studies often employ traditional feature selection and optimization methods, such as Recursive Feature Elimination (RFE) and genetic algorithms.While these methods have demonstrated effectiveness, the landscape lacks a comprehensive exploration of hybrid models that integrate RFE, Grey Wolf Optimizer (GWO), and diverse machine learning (ML) algorithms.A critical examination of the literature reveals limited instances where these specific techniques are harmoniously combined to enhance the predictive accuracy of breast cancer recurrence models.Moreover, the existing literature predominantly highlights the medical aspects of breast cancer recurrence prediction, neglecting an in-depth exploration of the methodology employed.This study endeavors to bridge this research gap by not only providing a medical perspective but also placing a significant emphasis on the unique methodology employed.The current state-of-the-art lacks an exhaustive analysis of the potential synergies and improvements that can be achieved through the integration of RFE, GWO, and ML algorithms for feature selection, optimization, and classification in the context of breast cancer recurrence prediction.By addressing these gaps, this research aims to contribute to the refinement of methodologies for breast cancer recurrence prediction, offering a more holistic and efficient approach that extends beyond the traditional medical focus.

Research questions
The research questions (RQs) that are being studied are as follows: RQ1.What is the importance of hybridizing various feature selection, optimization, and classification techniques in disease prediction?
RQ2. Can we achieve 100% accuracy utilizing the hybridization of feature selection and optimization techniques along with ML classifiers on breast cancer datasets?
RQ3. Whether the proposed hybrid model can be able to detect breast cancer as well as its recurrence at its earlier stages?RQ4.Whether this proposed hybrid approach outperformed other existing state-of-the-art models or not?

Objective
In this paper, a hybrid ML approach is proposed based on Recursive Feature Elimination

Contributions
The contribution of this study has been sketched up as follows: • Developed the hybrid ML-based approach employing feature selection, optimization, and classification techniques for breast cancer and its recurrence prediction obtaining enhanced performance outcomes; • Implemented iterative RFE and GWO cycles allowing for dynamic feature selection, adapting to changes in data patterns and enhancing the ability to capture the temporal dynamics of breast cancer progression and relapse; • Integrated RFE with GWO for feature selection improving the interpretability and efficiency of the classification model; • Compared and contrasted the proposed hybrid ML-based approach with some of the similar state-of-the-art works showing the novelty and significance of the study;

Paper structure
The study of this proposed work has been organized as follows: Section 2 discusses the research work being conducted in this field with a summary table.Section 3 represents the employment of the proposed dataset along with various techniques adopted in this work.Section 4 covers the study's architectural facet, including the proposed work's design, flow chart, block diagrams, and working principle.Section 5 describes the productive investigation of the proposed work, in contrast with the related results considered in this study.Section 6 winds up the study with an achievable extension to the proposed work.
The critical findings from these considered similar state-of-the-art works can be stated as that the maximum of research includes only basic ML approaches considering a smaller number of performance parameters.Besides, working on imbalanced datasets without proper data pre-processing may not provide us a good predictive outcomes.Next, ensembling only conventional ML approaches needs a feature selection technique, and integrating feature selection and feature extraction techniques may not be sufficient in achieving improved outcomes.As a consequence, we planned for hybridization of feature selection and optimization on conventional ML approaches to obtain enhanced predictive outcomes.

Materials and methods
The considered datasets are discussed in this section.In addition, the feature selection and optimization techniques employed in this study are briefly discussed.This study's various ML classification approaches are placed in the last subsection.

Datasets employed and pre-processing
Many criteria were set before we began considering data for the experiments in the study.Many researchers in the field of breast cancer use data from the Wisconsin Diagnostic Breast Cancer dataset (WDBC) and the Wisconsin Prognostic Breast Cancer dataset (WPBC), both of which may be found in the UCI Machine Learning Repository [22,23].Both datasets were obtained from the University of Wisconsin Hospitals.These datasets are incredibly granular, consisting of features extracted from digitized photos.Every detail lines up with the visible cell nuclei in the picture.The following Table 2 is a summary of the data sets available.Each dataset has numerical characteristics or properties associated with each sample or classification pattern.
The WDBC dataset is an extremely lean data clump made up of information mined from digitized photographs.This collection contains 569 example records, each containing 32 attributes (ID, Diagnosis, and 30 real-valued variables).All 30 input features allow for linear separation of the data set.All the details align with what can be seen in the photograph, which are the characteristics of cell nuclei.The first characteristic is a patient's identification, and the second is a label for whether the patient's cancer is malignant or benign.Calculated attributes for each cell nucleus fall in the 3-32 attribute range [24].The radius equals the mean distance from the entrance to every other point around the perimeter.The variance in grayscale values is what we call the texture parameter.The smoothness characterizes the regional variation in radius length.The formula for the compactness factor is as follows: (area squared -1.0)2/ perimeter2.The fractal dimension equals (a rough estimate of the coast)-1, and concavity describes the degree to which a contour is concave.Average, standard deviation and worst case are calculated across 30 characteristics.Field 3 reflects, for instance, the average radius, field 13 the standard deviation, and field 23 the worst radius.Features of the WDBC dataset indicate that there are three columns and three values (mean, standard error, and worst) for these characteristics.
The type and progression of breast cancer both impact the outlook for survival.A total of 198 observations and 47 recurrences (151 of which are not) make up the WPBC dataset.Like the other collections, the WPBC dataset includes both healthy and cancerous samples.The following are features of this data set: The first component is the patient's unique ID.As for the second characteristic, it's the output class: R for "recurrence" and N for "Non-Recurrence."Time is the third feature, and it describes the interval between episodes for "R" and "being healthy" for "N."Radius, area, perimeter (dimensions and shape of a nucleus), concavity, concave points, symmetry, fractal dimension (approximation of a coastline), compactness, texture, standard deviation of grayscale values, and smoothness (local variation in radius lengths) are the ten computed real values that the attributes 3-33 identify for cell nuclei.Tumor size, measured in centimeters, is the 34th feature.There are four distinct sizes of tumors.T-1 is less than two centimeters in length.The dimensions for a T-2 are between 2 and 5 centimeters.T-3 is longer than five centimeters.Any tumor that has ulcerated the skin or is connected to the chest wall is classified as T-4.Lymph node status is the number of malignant lymph nodes found during surgery.Lymph node status, or the number of auxiliary lymph nodes where cancer was found during surgery, is the 35th characteristic.Axillary lymph nodes, located in the armpit, are a primary site of metastasis for breast cancer.Lymph node status was missing as a value in four different records.Absent in four different files [25].
For ML models to make sense of visual input, data pre-processing is essential for any classification system.Using Data Preprocessing, we clean up the data set so that only accurate information is delivered.For optimal categorization results, ensure that your data is complete, accurate, and free of ambiguity.Errors and gaps in the data set can be remedied through preprocessing.In order to obtain clean data that is model-ready, the pre-processing stage is utilized to improve the quality of the dataset.The dataset included redundant and irrelevant information because it was compiled from many sources.We employ data cleansing methods to ensure that our data is free of such inconsistencies.Several pre-processing methods were applied to the Breast cancer dataset before classification tasks were performed using ML methods.The dataset was made more presentable during preprocessing by eliminating duplicates, missing values, and unnecessary layers.These procedures, which include a thorough cleaning, are necessary for getting the dataset ready for usage with machine learning models.It improves performance by removing unnecessary data characteristics.Several stages make up the preprocessing technique, each of which is described in turn below.
Noise in data is reduced, and missing values are handled during data cleaning.Get rid of blanks: The dataset was analyzed and used in this study [20].Since the WPBC dataset has some missing and irrelevant data while the WDBC dataset does not, we clean the data by substituting the proper values for the missing ones.One attribute value (represented by "?") is missing in four different WPBC instances.The attribute entails supplying missing data for all instances of a given class.Getting rid of outliers indicates they were particularly destructive.They significantly affect a model's predictions when using machine learning.To identify whether an outlier record is the consequence of a data collecting error or a unique event taken into account during data processing, researchers typically examine the records in question.An outlier is a statistic that doesn't fit in with the rest of the numbers.It's possible that removing outliers will result in a smaller dataset overall, but one that is nonetheless accurate.The analysis of statistical correlation eliminates the need to go into superfluous details.A common irrelevant aspect between the WPBC and the WDBC is the 'Sample code number,' which is disregarded because it does not influence the categorization process.By starting the training process with scale-normalized features, data normalization shortens the total duration of the operation.Normalization aims to make the values of the features more comparable to one another.

Feature extraction technique: Recursive Feature Elimination (RFE)
It is a method for selecting features based on their statistical significance that involves iteratively selecting features.The degree of statistical significance (p) is determined using the criteria for hypothesis testing.In hypothesis testing, the p-value is a statistical measure that represents the observed significant value of the input characteristic [26].If the p-value of a certain input characteristic is less than the significance threshold (ρ), then there is a statistical link between the input and output features.The value of ρ is 0.05 for RFE.Starting with the dataset (D) having input feature set as {f 1 ,f 2 ,f 3 ,. ...,f n }.The algorithm recursively deletes the features based on two selected hypotheses Null hypothesis ðH 0 Þ and Alternative hypothesis ðH a Þ as per Eqs (1) and ( 2) respectively.
Null Hypothesis ðH 0 Þ: This hypothesis states that the feature set from the dataset (D), a subset of the feature set having will statistical importance (p) having conditions as per Eq 1 will be removed from the feature set.
Alternative Hypothesis ðH a Þ: The feature set having statistical importance as per Eq 2 will be removed from the dataset.
The p-value can be calculated using the logistic regression model for each feature in the feature set of dataset D. The p-value can be calculated using Eq (3) or Eq (4).
Where fi is the selected feature for calculating the p-value, P is the probability of the selected feature, σ 0 , σ i are the logistic regression parameters, and m is the value of selected feature fi.

Optimization technique: Grey Wolf Optimization (GWO)
Grey Wolf Optimization (GWO) is an algorithm that imitates the social structure and hunting skills of wild grey wolves to discover optimal solutions to optimization problems.GWO is a metaheuristic optimization method that combines swarm intelligence with swarm algorithms, much to PSO and GA [27].The intricate social organization and astute hunting strategies of the grey wolf were the driving forces behind GWO.Most of the time, grey wolves are the most dominant predators in the areas where they live.The average number of grey wolves in a group is five to twelve.The entire wolve group is divided into four different sub-groups such as: α, β, δ, and ω as shown in Fig 1 .The wolf in α group is the dominant wolf in the group and guides the others in activities like hunting, moving, and eating.In the absence of the leader wolf from the α group, whether due to illness or death, the strongest of the β wolves takes leadership.The wolves in δ and ω have less influence and power than α and β [28,29].The size of the above-said groups is formatted as Eq (5).
Mathematical model.Grey wolf social structure and hunting strategy (including tracking, encircling, and attacking) are modeled mathematically in this section using the GWO algorithm, as depicted in Fig 2 .Social structure.Eq (6) may be used to represent the results of the GWO algorithm's attempt to mathematically model the grey wolf group's hierarchical structure.The GWO algorithm uses α, β are used for hunting (for optimization), followed by δ and ω.
Encircling.For hunting the prey, the grey wolves initially encircle the prey.The encircling phase of the wolves can be represented by Eq (7).
Here, P t ! and W t �! is a vector representing the position of the prey respectively.C ! is the coefficient vector ranging [-1, 1] as represented in Eq (10).D ! is the calculated distance for updating the position of the wolf, and t shows the number of iterations.The position vector of the wolf and the prey can be represented as position vectors (x, y).The new position of the grey wolf by using Eq (8).
Here, A ! is a coefficient vector ranging from [0,1] which can be calculated using Eq (9).
t ! 1 and t ! 2 are two random vectors in the range [0,1].However, a ! is the vector set linearly decreasing from 2!0.In order to represent the hunting behavior of grey wolves, it is believed that α (the most likely solution), β, and δ know more about where the prey may be lurking.
The algorithm keeps track of the best three solutions it has identified so far and forces the others (the ω wolves) to adjust their location accordingly.The distance vector for wolves from α, β, and ω can be represented as Eqs ( 11)-( 13).Accordingly, the wolves from each group can be updated as Eqs ( 14)-( 16), respectively.
Attacking.A wolf may follow its prey anywhere within a hypersphere.But that's still not enough to replicate the grey wolf's social intelligence.As was previously said, social hierarchy is crucial to the success of a pack's hunt and its ability to stay alive.The α, β, and ω solutions are thought to be the best for simulating social hierarchy.For the purpose of simplicity, GWO assumes that there is only one answer for each class, even if, in nature, there may be more than one wolf in each category.Given that α, β, and ω are the best answers in the population, it is plausible to believe that they know where the global optimum of optimization issues is.As a result, the other wolves need to revise their strategies as Eq (17).
Exploration and exploitation.When optimizing a task, an algorithm may exhibit both exploratory and exploitative tendencies.To avoid being stuck in a local optimum, the algorithm's exploration phase involves making unexpected modifications to the solutions in an effort to find previously unexplored regions of the problem's search space.Exploitation aims to refine the predicted results from the exploration phase by learning about the area around each solution.Therefore, solutions should be tweaked incrementally to converge to the global optimum.The major problem is that exploitation and exploration often go against one another.As a result, for an algorithm to efficiently estimate the global optimum of a problem, it must be able to take into account and strike a compromise between these competing behaviors during optimization.

Classification techniques employed
The selection of seven ML methods in this study for breast cancer and breast cancer relapse prediction reflects a deliberate strategy to harness the strengths of diverse algorithms and enhance the robustness of our predictive models.The likelihood of breast cancer recurrence can be predicted using one of several different categorization systems [30].ML and statistical approaches classify patients into benign and malignant or relapse and non-relapse groups using their medical histories, genetic profiles, and clinical data.Several ML classification methods, including the RFE and GWO, are used in the dataset, including the NB, KNN, LR, SVM, MLP, RF, and DT [31,32].These seven ML methods were chosen to provide a comprehensive exploration of different modeling paradigms.NB works as a probabilistic classifier and performs conditional probability, KNN considers local data neighborhoods, LR offers simplicity and interpretability, SVM excels in handling complex relationships, MLP captures intricate patterns, RF provides robustness against overfitting, and DT offers interpretability.These diverse methods allow us to account for various aspects of the complex breast cancer landscape, ensuring a more holistic understanding and accurate prediction of breast cancer outcomes.

Proposed model
The reported model uses two types of Breast cancer relapse datasets.Fig 3 shows the workflow of the proposed model.Initially, the datasets undergo a preprocessing step to handle the outliers present in them.The RFE feature selection algorithm is applied to the processed dataset to identify the correlated features.The GWO optimization algorithm is then applied to the featured dataset to bring the optimized number of features into the front without hampering the dataset's utility.Finally, seven different ML classifiers are applied to evaluate the performance • Preprocess the raw dataset to obtain D • Apply RFE to the preprocessed dataset D • Define H 0 and H a • for i 1 to k • find p-value of f i as log P 1À P À � • Determine the suitable hypothesis for the selected feature • Update the optimal features in D 0 to obtain optimal dataset D'' • End For • Apply ML classifiers to D'' for calculating the evaluative parameters.

Results and discussion
Several presumptions are included in evaluating this suggested ML-based hybrid approach.The feature selection technique RFE and the optimization technique GWO were applied to seven conventional ML techniques to build new novel ML-based hybrid approaches by enhancing the evaluative measures [33,34].A workstation outfitted with 8GB of RAM, a 500GB SSD, a 1TB HDD, a 3.6GHz Intel Core i5 CPU, and Ubuntu 20.04 has been used to successfully test the proposed system.An extensive empirical study of the gathered results should be a part of any planned undertaking.Through a methodical experimental procedure, these measures seek to build a real-to-expected class confusion matrix.True positives and negatives are represented by the letters T A and T B in the confusion matrix, whereas false positives and false negatives are represented by the letters F A and F B [35][36][37].Performance metrics for classification in this study include Accuracy (A C ), Misclassification Rate (M R ), Precision (P R ), Sensitivity (S N ), Specificity (S P ), F1-Score (F S ), False Negative Rate (F NR ), False Positive Rate (F PR ), Mathew's Correlation Coefficient (M CC ), and Balanced Accuracy (B A ). Detailed formulations for these metrics are provided in Eqs ( 18)- (27).
ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi

Results analysis on WDBC dataset
The various ML-based hybrid approaches considered in this study employ the feature selection technique RFE, the optimization technique GWO, and seven conventional ML techniques, including NB, KNN, LR, SVM, MLP, RF, and DT.In the first, we applied these hybrid approaches to the WDBC dataset.Table 3  +GWO+MLP" outperforms all other six suggested hybrid approaches with an accuracy of 98.25%, precision of 98.13%, sensitivity of 99.06%, specificity of 96.92%, F1-score of 98.59%, etc. Besides, this hybrid ML approach also provides enhanced outcomes comparatively for other evaluative parameters considered in this work, including misclassification rate, FNR, FPR, MCC, and balanced accuracy, as shown in Table 3, which leads us to consider this as the recommended hybrid model for WDBC dataset.The ROC Curve and the AUC value for this recommended hybrid approach are calculated in the next, as depicted in Fig 14 .The obtained AUC value of 0.982 for this recommended hybrid approach itself justifies the significance of this proposed approach on the WDBC dataset.an accuracy of 93.27%, precision of 95.56%, sensitivity of 96.63%, specificity of 73.33%, F1-score of 96.09%, etc., the "RFE+GWO+MLP" hybrid approach is clearly superior to the other six suggested hybrid approaches.Furthermore, as shown in Table 4, this hybrid ML approach also provides improved outcomes relative to other evaluative parameters considered in this work, such as misclassification rate, FNR, FPR, MCC, and balanced accuracy.

Comparative analysis
In order to show the novelty and significance of the proposed ML-based hybrid approach, we have added a comparative analysis.Tables 5 and 6 display the comparison between the proposed ML-based hybrid approach and the considered relevant state-of-the-art works, based on WDBC and WPBC datasets respectively, in terms of the findings obtained for accuracy, precision, specificity, sensitivity, F1-score, and AUC.The proposed work is found to be superior to and inferior to others on several evaluation parameters in both of the datasets, WDBC and WPBC, as depicted in Tables 5 and 6.Although the proposed work slightly fails to outperform these similar existing works based on the WDBC dataset as depicted in Table 5, whereas, it outperforms similar existing works based on the WPBC dataset as depicted in Table 6.

Conclusion and future scope
The ML-based hybrid approach suggested here makes use of not one but two breast cancer datasets: WDBC and WPBC.This investigation used RFE and GWO to further analyse and clarify this raw data.Both datasets undergo preliminary data processing, including imputation, scaling, and other methods.Second, an RFE selects the most numerous and pertinent features from the training datasets in order to accurately forecast the target variable.The GWO determined that the most effective combination of the selected features was necessary for a precise response.Using an 80/20 split, we examined the effectiveness of the proposed method.Therefore, the proposed hybrid technique selected features and improved breast cancer and recurrence classification accuracy.Several studies have shown accuracies of 98.25% and 93.27% on the WDBC and WPBC datasets, respectively; precisions of 98.13% and 95.56%; sensitivities of  effectiveness of the machine learning algorithms.We have also compared and contrasted the proposed hybrid ML-based approach with some of the similar state-of-the-art works showing the novelty and significance of the study.
Every research has advantages and disadvantages.The use of multiple ML approaches and optimization techniques may lead to increased computational demands.While GWO optimization contributes to model refinement, the effectiveness of the hybrid model is dependent on the careful tuning of hyperparameters.Sensitivity to hyperparameter choices is a common limitation shared by optimization-based methods, and we highlight the need for thoughtful parameter selection.The results of this investigation can be improved by using the ensemble methods to more breast cancer and breast cancer recurrence datasets with unique characteristics.

Fig 3 .Algorithm 1 :
Fig 3. Proposed work block diagram.https://doi.org/10.1371/journal.pone.0304768.g003 vector D ! • Determine the position of the search agent W i • Calculate the fitness function • End for • Update a, A, C • Find the best solution based on fitness function • Update the next position of W α , W β , and W ω as

Fig 13 .Fig 14 .
Fig 13.Recorded balanced accuracies in % for the hybrid ML approaches on the WDBC dataset.https://doi.org/10.1371/journal.pone.0304768.g013 Seven traditional ML methods (NB, KNN, LR, SVM, MLP, RF, and DT) are combined with the feature selection method RFE and optimization technique GWO to generate the various ML-based hybrid approaches addressed in this paper.The second part of this research involved using these hybrid methods on the WPBC data set.In Table4, we present the findings of in-depth studies of the effectiveness of the proposed ML-based hybrid techniques.Results for A C , M R , P R , S N , S P , F S , F NR , F PR , M CC , and B A are displayed graphically in Figs[15][16][17][18][19][20][21][22][23][24] Fig 25  depicts  the results of calculating the ROC Curve and the AUC value for this suggested hybrid technique.The significance of this proposed method on the WPBC dataset is supported by the AUC value of 0.936 obtained using the suggested hybrid approach.

Fig 24 .
Fig 24.Recorded balanced accuracies in % for the hybrid ML approaches on the WPBC dataset.https://doi.org/10.1371/journal.pone.0304768.g024 Random Forest (RF) and Decision Tree (DT).The various experiments are performed on Wisconsin Diagnostic Breast Cancer (WDBC) and Wisconsin Prognostic Breast Cancer (WPBC) datasets sourced from the open access warehouse of the University of California, Irvine-Machine Learning (UCI-ML), considering ten evaluative measures.

Table 6 . Comparative analysis of the proposed hybrid approach to the considered state-of-the-art works based on wpbc dataset.
https://doi.org/10.1371/journal.pone.0304768.t006